Hawkes model for price and trades high-frequency 

dynamics 



We introduce a multivariate Hawkes process that accounts for the 
dynamics of market prices through the impact of market order arrivals at 
microstructural level. Our model is a point process mainly characterized 
by 4 kernels associated with respectively the trade arrival self-excitation, 
the price changes mean reversion the impact of trade arrivals on price 
variations and the feedback of price changes on trading activity. It allows 
one to account for both stylized facts of market prices microstructure 
(including random time arrival of price moves, discrete price grid, high 
frequency mean reversion, correlation functions behavior at various time 
scales) and the stylized facts of market impact (mainly the concave-square- 
root-like/relaxation characteristic shape of the market impact of a meta- 
order). Moreover, it allows one to estimate the entire market impact 
profile from anonymous market data. We show that these kernels can be 
estimated from the empirical conditional mean intensities. We provide 
numerical examples, application to real data and comparisons to former 
approaches. 

1 Introduction 

Market impact modeling (i.e, the influence of market orders on forthcoming 
prices) is a longstanding problem in market microstructure literature and is 
obviously of great interest for practitioners (see e.g., [6] for a recent review). 
Even if there are various ways to define and estimate the impact associated 
with an order or a meta-ordei0, a large number of empirical results have been 
obtained recently. The theory of market price formation and the relationship 
between the order flow and price changes has made significant progress during 
the last decade [7]. Many empirical studies have provided evidence that the 
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price impact has, to many respects, some universal properties and is the main 
source of price variations. This corroborates the picture of an "endogenous" 
nature of price fluctuations that contrasts with the classical scenario according 
to which an "exogenous" flow of information leads the prices towards a "true" 
fondamental value 0- 

We will not review in details all these results but simply recall the point of 
view of Bouchaud et al [6] . These authors propose a model of price fluctuations 
by generalizing Kyle's pioneering approach [18j according to which the price is 
written (up to a noise term) as the result of the impact of all trades. If n stands 
for the trading time, Bouchaud et al. model is written as follows [HI [7]: 



where rjj is a white noise while = ejf{vj) with ej = +1 (resp. Sj = —1) if a 
trade occurs at the best ask (resp. at best bid) and f{vj) describes the volume 
dependence of a single trade impact. The function G{n — j) accounts for the 
temporal dependence of a market order impact. 

Even if models like ([!]) represent a real breakthrough in the understanding of 
price dynamics they have many drawbacks. First, the nature and the status of 
the noise rjj is not well defined. More importantly, these models involve discrete 
events (through the trading or event time) only defined at a microstructural 
level though they intend to represent some coarse version of prices (indeed, the 
price pn in the previous equation can take arbitrary continuous values, more- 
over, calibrating its volatility at any scale involves some additional parameter, 
...). Moreover, these models cannot account for real time (i.e "physical time") 
dynamics or real time aggregation properties of price fiuctuations. In that re- 
spect, they are not that easy to be used in real high frequency applications 
such as optimized execution. To make it short, though being defined at finest 
time scales, they cannot account for the main microstructure properties of price 
variations related to their discrete nature: prices live on tick grids and jump at 
discrete random times. 

Our aim in this paper is mainly to define continuous time version of the 
market impact price model discussed previously. For that purpose, point pro- 
cesses [9] provide a natural framework. Let us not that point processes have 
been involved in many studies in high frequency finance from the famous 'zero- 
intelligence" order-book models [211 E] to models for trade [THl H] or book 
events [HI [10] irregular arrivals. In a recent series of papers [3 [2 [1], we have 
shown that self-excited point (Hawkes) processes can be pertinent to model the 
microstructure of the price and in particular to reproduce the shape of the sig- 
nature plot and the Epps effect. Our goal is to extend this framework in order 
to account for the market impact of market orders. In that respect, the main 
ideas proposed in refs. [U [131 [IS] can be reconsidered within the more realistic 
framework of point processes where correlations and impact are interpreted as 
cross and self excitations mechanisms acting on the conditional intensities of 
Poisson processes. This allows us to make a step towards the definition of a 
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faithful model of price microstructure that accounts for most recent empirical 
findings on the liquidity dynamical properties and to uncover new features. 

The paper is organized as follows. In Section [5] we show how market order 
impact can be naturally accounted within the class of multivariate Hawkes pro- 
cesses. Our main model for price microstructure and market impact is presented 
and its stability is studied. Numerical simulations of the model are presented. 
In Section [3l the microstructure of price and market order flows are studied 
through the covariance matrix of the process. Our results are illustrated on nu- 
merical simulations. An extension of the model that accounts for labeled agents 
is defined in Section |4] and results on market impact are presented in Section 
[5] including an explanation on how the newly defined framework allows one to 
estimate the market impact profile from non labeled data. Section [6] shows how 
kernels defining the dynamics of trade occurrences and price variations can be 
nonparametrically estimated. All the theoretical results obtained in the previ- 
ous Sections are illustrated in Section [7] when applied on various high frequency 
future data. It allows one to reveal the different dynamics involved in price 
movements, market order flows and market impact. Intraday seasonalities are 
shown to be taken care in a particularly simple way. We also discuss, on a 
semi-qualitative ground, how the market efficiency can be compatible with the 
observed long-range correlation in supply and demand without any parameter 
adjustment. Conclusion and prospects are reported in Section |8] while technical 
computations are provided in Appendices. 

2 Hawkes based model for market microstructure 
2.1 Definition of the model 

As recalled in the introduction, in order to define a realistic microstructure 
price model while accounting for the impact of market orders, the framework 
of multivariate self-excited point processes is well suited. A natural approach 
is to associate a point process to each set of events one wants to describe. We 
choose to consider all market order events and all mid-price change events. 

Let us point out that we will not take into account the volumes associated 
to each market orders. Though this can be basically done within the framework 
of marked the point processes, it would necessitate cumbersome notations and 
make the estimation much more difficult. This issue be discussed briefly in 
Section [8] and addressed in a forthcoming work. 

2.1.1 Market orders and price changes as a 4 dimensional point pro- 



The arrivals of the market orders are represented by a two dimensional counting 
process 



cess 




(2) 
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representing cumulated number of market orders arrived before time t at the 
best ask (T^) and at the best bid {T^)- Each time there is a market order, 
either T+ or jumps up by 1. We suppose that the trade processH T is a 
counting process that is fully defined by 

• A^t the conditional intensity vector of the process Tt at t. 

The price is represented in the same way. Let Xt represents a proxy of the 
price at high-frequency (e.g., mid-price). As in refs. [3j[2l[T] we write 



Xt = N+ - N[- 



(3) 



where Nj^ (resp. Nj^) represents the number of upward (resp. downward) price 
jumps at time t[f|. Thus, each time the price goes up (resp. down), (resp. 
N^) jumps up by 1. We set 



(4) 



As for the trade process, Nt is fully defined by 

• t the conditional intensity vector of the process Nt at t. 
The 4 dimensional counting process is then naturally defined as 



Pt 



Nt 
V I 

as well as its associated conditional intensity vector 



Tt 
Nt 



(5) 



At = 



(6) 



2.1.2 The Model 

Basically, the model consists in considering that the 4-dimensional counting 
process Pt is a Hawkes process [T5] . The structure of a Hawkes process 
allows one to take into account the influence of any component in Pt on any 

■^In the following, a "trade" will refer to the execution of a given market order (which 
might involve several counterparts). Thus the process T will be referred indifferently to as 
the market order arrival process or the trade arrival process. 

^Let us point out that, in this model, we do not take into account the size of the upward 
or downward jumps of the price. We just take into account the direction of the price move, 
(reps. —1) for any upward (reps, downward) jumps 
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component of At . In its general form the model is represented by the following 
equatioifl 

\t^M + ^*dPt. (7) 

where $f is a 4 x 4 matrix whose elements are causal positive functions (by 
causal we mean functions supported by 11+). Moreover, we used the "matrix 
convolution" notation, 

B^dPt^ I Bt-sdPs, 

where Bi_gdPg refers to the regular matrix product. M accounts for the exoge- 
nous intensity of the trades, it has the form 



M = 



/ ^ \ 

A* 


V J 



(8) 



since, by symmetry we assume that the exogenous intensity of T+ and are 
equal while mid-price jumps are only caused by the endogenous dynamics. 

The matrix $ or any of its sub-matrices (or element) are often referred to 
as Hawkes kernels. Each element describes the influence of a component over 
another component. Thus, it is natural to decompose the 4x4 kernel $t in four 
2x2 matrices in the following way 

*.^rss) 



where 



• <i>^ (influence of T on A^) : accounts for the trade correlations (e.g., 
splitting, herding, . . . ) . 

• <i>-^ (influence of T on A^) : accounts for the impact of a single trade on 
the price 

• <i>^ (influence of on A^) : accounts for the influence of past changes in 
price on future changes in price (due to cancel and limit orders only, since 
changes in price due to market orders are explicitly taken into account by 

• (influence of iV on A"^) : accounts for feedback influence of the price 
moves on the trades. 

If we account for the obvious symmetries between bid-ask sides for trades and 
up-down directions for price jumps these matrices are naturally written as: 



*Let us point out that Section [4] will introduce a generalization of this model including 
labeled trades. 
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where all 4>i ' ' are causal functions and the upperscripts s and c stand for "self" 
and "cross" influences of the Poisson rates (we use the same convention which 
was initially introduced in [3] for (t>'^). Thus, for instance, on the one hand, 0^'*' 
accounts for the influence of the past buying (resp. selling) market orders on 
the intensity of the future upward (reps, downward) price jumps. On the other 
hand, (j)^''^ quantifies the influence of the past buying (resp. selling) market 
orders on the intensity of the future downward (resp. upward) price jumps. 

Remark 1 : All these 2x2 matrices commute since they diagonalize in 
the same basis (independently of t). Their eigenvalues are the sum (resp. the 
difference) of the self term with the cross term. This property will be used all 
along the paper. Most of the computations will be made after diagonalizing all 
the matrices. 



2.1.3 The Impulsive impact kernel model or how to deal with simul- 
taneous jumps in the price and trade processes 

It is important to point out that a buying market order that eats up the whole 
volume sitting at best ask results in an "instantaneous" change in the mid-price. 
From our model point of view, it would mean that T and N have simultaneous 
jumps with a non zero probability. It is clearly not allowed as is by the model. 
However, from a numerical point of view, this can be simulated by just consid- 
ering that the jump in the price takes place within a very small time interval 
(e.g., of width 1ms which is the resolution level of our data) after the market 
order has arrived. There is no ambiguity in the "direction" of the causality : it 
is a market order that makes the price change and not the other way around. 
This would result in an impact kernel (j)^'^ which is "impulsive", i.e., localized 
around 0, actually close to a Dirac distribution 5t. 

From a practical numerical point of view, choosing 0^'* to be a Dirac dis- 
tribution is fairly easy. It basically amounts to considering that it is a positive 
function of given norm / and with a support of the order of a few mil- 
liseconds. Let us point out that it means that, the price increment between 
the moment of the trade and At milliseconds afterwards, follows a Poisson law 
whose parameter is /. This actually allows price jumps (spread over a few 
milliseconds) of several ticks (greater than 1). 

Of course, from a mathematical point of view, this is not that simple. The 
limit At — > has to be defined properly. This will be rigorously defined and 
extensively discussed in a future work and is out of the scope of this paper. The 
"practical" approach described above and the fact that we can formally replace, 
in all the computations, (j)^'" by a I5t is far enough for our purpose. 

It is clear that we expect to find an impulsive component in 0^'^ when 
estimating on real data. Though, a priori, we do expect also a non singular 
component that could have a large support (e.g., when the marker order eats 
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up only a part of the volume sitting at best ask), we will see in estimations that 
most of the energy of (f)^'" is localized around 0. Moreover, we will find that (/)^'° 
is close to 0. 

All these remarks will lead us to study a particularly interesting case of the 
previously defined model for which 

• (j)!'-' = I6t and cp^''^ = 0. 
Consequently 

• (j)^ = I5tl, where I refers to the identity matrix. 

This model will be referred in the following as the Impulsive Impact Kernel 
model. 

Before moving on and study the conditions for our model to be well defined, 
we need to introduce some notations that will be used all along the paper. 



2.1.4 Notations 

Let us introduce the following notations: 

Notations 1 If ft is a function fz refers to the Laplace transform of this func- 
tion, i.e., 

% = j e-"*ftdt. 
6t refers to the Dirac distribution, consequently 

?z = 1. 

Moreover, we will use the convenient convention (which holds in the Laplace 

domain) : 5fk5t = St. 

The L^ norm of f is referred to as: 

11/11 = / \ft\dt. 

Thus 



if yt, ft>0 then 11/11= J ftdt = fo 



We extend these notations to matrix of functions. Thus, if Ft is a matrix whose 
element are functions of t, let denote the matrix whose elements are the 
Laplace transform of the elements of Ft . Following this notation, we note 

Notations 2 If M is a matrix, M* refers to the matrix M whose each element 
has been replaced by its conjugate and refers to the hermitian conjugate 
matrix of M. 



8 



Notations 3 Whenever At is a stationary process, we will use the notation 

A = E{Xt) = 



AT 

V A-y 



where 



A^ = E{X^:) = EiX^;) and A^ = i?(A^") = ^;(A^-) 



Let us point out that the fact that the ± mean intensities are equal is due to the 
symmetries of the kernels in (|10p and (jlip . 

Notations 4 We define the kernel's imbalance ; 

• A(j)T = 0^'^ - and = - ^^■'^ 

• = (/)^'" - ^^'^ and = 0^^'' - 0^^^= 

• = 0^'" - (/.^^^ and A(^^ = 0^^" - 0^'^= 

• A(/)^ = 0^^" - ^^''^ and A4>^ = 0^'^ - 

iet us point out that since the kernels are all positive functions, one has, re- 
placing ? by T , N , I or F : 

= and ^^^^U''<^\\, 

and consequently 

M>l = ^l;^ = U'-^\\-\W^^\\ 

2.2 Stability condition - Stationarity of the price incre- 
ments 

The process Pt defined in Section [2.1.2l is well defined as long as the matrix $t is 
locally integrable on R+. Hawkes, in his original papers [TH[TS], has formalized 
the necessary and sufficient condition for the previously introduced model ([7]) 
to be stable : the matrix made of the norm of the elements of $ should have 
eigenvalues whose modulus are strictly smaller than 1. 

This condition can be expressed in terms of conditions on the norm of the 
different kernels : 

Proposition 1 (Stability Condition) The hawkes process Pt is stable if and 
only if the following condition holds : 

(H) The eigenvalues of the matrix $o have a modulus strictly smaller than 1. 
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In that case, Pt has stationary increments and the process Xt is strictly station- 
ary. Moreover (H) holds if and only if 



c+ < (l-a+)(l-&+) and a+, 6+ < 1, (13) 



where 



• 6+ = (f)^'^ + (pQ^'^ and 

Moreover (|13p implies that 

A0^A0^ - K < (1 - A^)(l - A0^), (14) 

where we used Notations^ 
The proof is in Section 19.11 

Let us point out that in the case there is no feedback of the price jumps on the 
trades, i.e., — (or c"*" =0), then the stabiUty condition ([T^ is equivalent 
to a+ < 1 and b+ < 1, i.e., {{(p^-'W + < 1 and ||(/)^'''|| + < 1. 

The mean intensity vector is given by the foUowing Proposition. 

Proposition 2 (Mean Intensity) We suppose that (H) holds (i.e., ([T^ ). 

Then 

k^E{\t) = {l-%oy^M. (15) 



This can he written as 



and 



A^ 
A^ 



^l{l + Da){l-<^^)v (16) 



^^^^{l + bo)^v (17) 



where v = \ | ) and where Dt is defined by its Laplace transform 



D^ = {{l-^i)il-<^>^)-^i<S>i)-'-l. (18) 



Proof : The proof is basically an adaptation of a proof previously presented in 
[2]. Let the martingale dZt be defined as 

dZt = dPt - Xtdt. 

Using (O, we get 

At = A/ + $ * dPf = Af + $ * + $ * Xtdt. (19) 
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Thus 

((5I-$)*At = M + $*dZt. (20) 

Consequently, 

At = [Stl + *) ★ il/ + {6tl + * $ ★ dZt, (21) 
where 5* is defined by 

=$,(!- 

Taking the expectation, we get (IT5]) . Moreover, we have 

I-^ = { Z ^ „ (22) 

Using Remark 1 a the end of Section 12.1. 2[ one can easily check that 

where Dt is defined by (|18p . The Equations (IT51) and p7)) are direct conse- 
quences of this last equation combined with (jlSp . 

In the following we will always consider that (H) holds, i.e., that (|13p 
holds. 



2.3 Numerical simulations 

In order to perform numerical simulations of Hawkes models, various methods 
have been proposed. We chose to use a thinning algorithm (as proposed, e.g., 
in [22]) that consists in generating on [0,tmax] an homogeneous Poisson process 
with an intensity M > sup^gjQ j^^^j (Af ^, Af^ ). A thinning procedure is then 
applied, each jump being accepted or rejected according to the actual value of 
A^ or Af ^ . In order to illustrate the 4-dimensional process we chose to display 
only the price path 

Xt = N+ - TVr (24) 
and the cumulated trade process path as defined by 

Ut^T+-T-. (25) 

In Figure [1] we show an example of sample paths of both Xt and C/t on a 
few minutes time interval. All the involved kernels are exponentials. Some 
microstructure stylized facts of the price can be clearly identified directly on 
the plot : price moves arrive at random times, price moves on a discrete grid 
and is strongly mean reverting (see beginning of next section). In the large 
time limit, one can show that these processes converge to correlated Brownian 
motions (see [2] or Section [331) . This is illustrated in Fig. [2] where the paths 
are represented over a wider time window (almost 2 hours). As discussed in 
Section [773} since we choose (jr'"''^ — and (p^''' = 0, the small time increments 
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of Ut remain correlated while the price increments correlations almost vanish. 
This can be observed in Fig. [2] where the path of Ut appears to be smoother 
than the path of Nf. 



:(a) 



-A 




200 300 

time (s) 



Figure 1: Example of sample paths for the cumulated trade ([25]) (a) and price 
(|24|) (b) processes of the model with exponential kernels. We chose 0"^''^ = 
^ ^Lc ^ ^R^s ^ 0, = 0.03 e-5*/ioo^ ^N.c ^ 0.05 e-*/io, (f>^-' = 

25 e-i"o* and ch^^'' = 0.1 e-*/^. 



In the next section we study the microstructure properties of both the price 
and the market order flow. 



3 Microstructure of price and market order flow 

The model we introduced above can be seen as a generalization of the price 
microstructure model previously introduced in [3]. Indeed, the 2-dimensional 
model defined in [3] can somewhat be seen as the "projection" on the price 
components Nt of the model defined in Section [2.1.2l Thus, it easy to show that 
it will account for all the price microstructure stylized facts already accounted for 
by the model in [3] . This includes the characteristic decreasing shape of the mean 
signature plot E{{X^t — Xg)'^) / At (which explains why estimating the diffusing 
variance using high frequency quadratic variations leads to a systematic positive 
bias). As explained in [3], this effect is due to the high frequency strong mean 
reversion observed in real prices and can easily be modeled by choosing kernels 
such that ^ We refer the reader to [3] for all the discussions 
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2000 4000 6000 

time (s) 

Figure 2: Example of sample paths for the cumulated trade (f25|) (a) and price 
((24|) (b) processes of the same model as in Fig. [1] over a wider range of time. 
One does not see the discrete nature of time variations anymore. Ut and Xt 
appear as correlated processes. 



concerning the signature plot, the problem of variance estimation using high 
frequency data and the link with mean reversion of price time-series. 

Thus, in this Section, we shall mainly focus one price correlations and market 
order correlations and how they relate one to the other. 



3.1 Price and Trade covariance function 

Following [2] , we define the covariance matrix of the normalized process at scale 
h and lag r by 

V<^^ = h-'CoY {Pt+h+r - Pt+r,Pt+h ~ Pt) , (26) 

where we normalized by h in order to avoid a trivial scale dependence. Let 
us note that, since the increments of Pt are stationary (we suppose that (fTSll 
is satisfied), the previous definition does not depend on t. Thus, it can be 
rewritten as 

4'^ = (^{J^^'^ dPj - Ah)i£ dP, - Ah)j . (27) 
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In [TJ [2] , it is proven that the the Laplace transform of wi' can be expressed as 
a function of the Laplace transform of gi^'^ (where g^.'^^ = (1 ^ 'ft^)"'^) ^^"^ ' 

^i'^) -?W(I-S.)-iS(I-$t)-i (28) 
where E is the diagonal matrix 

From this result, one can easily deduce an analytical formula for the price auto- 
covariance function. 

Proposition 3 (Price auto-covariance) Let Ct'^^'^ he the normalized auto- 
covariance of the price increment : 

fjN,{h) ^ iE{{Xt+h - Xt){Xt+r+h - Xt+r)), (30) 



then 



(1 - A</.^)(1 - A0^) - A0-f A0 

Before proving this Proposition, we first need the following Lemma, which is a 
direct consequence of (|T8l) and (|23)) . 

Lemma 1 

(i-$,)-ii](i-$t)-i = (i+5,)(i+5:)( |f |C ) (32) 



where 



= A^(I - $^)(I - $^)* + A^$^$^* , 

- K^¥{1 - $^)* + A^$^*(I - $^), 

Ei = A^(I - $^)$^* + A^(I - $^)*$^, 

E^ = A^$^$^* + A^(I - $^)(I - $^)*. 

Proof of Proposition : Using the symmetries of all the kernels, and the fact 
that Xt = - , we get 

Thus, if we define df and d^ such that 
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we have 

Using Remark 1, we get that 



(1 - A(/.^)(l - A0^) - A0-f A0-f 



which proves (PT|) . 



Using similar computations, one derives an analytical formula for the auto- 
covariance of the cumulated trade process 

Proposition 4 (Trade auto-covariance) Let Ct'^^^ he the normalized co- 
variance of the increments of the cumulated trade process Ut defined by ([25]) . 



then 



C. 



1 



t+h 



Ut){Ut+r+h - Ut+r)), 
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(1 ^ A<?!>^)(1 - A0^) - A<?i^A0-f 



(33) 
(34) 



3.2 Numerical simulations 

In order to illustrate these results, in Fig. [3] we have plotted both theoretical 
(solid lines) and estimated (•) correlation functions. The sample we used for the 
estimation contains around 300.000 trading events and corresponds to the same 
kernels as the ones used for Fig. [2j One clearly see that empirical estimates 
closely match theoretical expressions. Moreover, one can observe that the trade 
increment autocorrelation has an amplitude that is an order of magnitude larger 
than the price increment covariance. This issue will be addressed in Section [7.3l 



3.3 On the diffusive properties of the model 

Let us point out that we know [2 that a centered d-dimensional Hawkes pro- 
cess Pt diffuses at large scales {h +oo) towards a multidimensional gaussian 
process : 

-^{Pi^t - e{Pm)) (I - ^^r^^^'^wt 

where Wt is a standard d-dimensional Brownian motion and E is defined by (f29|) . 
This is a very general result that can be applied here to obtain the covariance 
matrix of the diffusive limit of our 4-dimensional Hawkes process Pt . Actually, 
if one is just interested in the diffusive variance of Pt, one can easily obtain it 
directly from the definition of the covariance matrix (|26l) (noticing that q'^'' 
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50 100 150 

T (S) 



Figure 3: Empirical and theoretical (From Eqs. (|31|) and (|34|)) autoccorelation 
functions for the process defined in Fig. [2] (a) Autocorrelation of the increments 
of Ut- (b) Autocorrelation of the price increments (i.e., the increments of Xt). 
In both cases, for the sake of clarity with have removed the lag r = 0. 

corresponds to the variance of -^Pht)- One gets that the diffusing variance is 

(I— $o)~^S(I — ^q)^^. It is then straightforward to obtain the diffusive variance 
of the price Xt = N^^ - N^T : 

^_ V2y^A^(A0^)2+A^(l-A0;^)2 
~ (1 - A0^)(1 - A^^) - A0^A0^ 

(Let us point out that shows that the denominator is positive). 

4 Model with labeled agents 
4.1 Accounting for labeled agents 

The trade arrival process T models anonymous trades sent by anonymous agents : 
this corresponds to the typical trade information sent by most exchanges (i.e., 
the trades are unlabeled, there is no way to know whether two different trades 
involves the same agent or not). However, there are many situations where one 
has access to some labeled data corresponding to some specific labeled agents. In 
this case, it could be very interesting to be able to analyze the impact of these 
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labeled trades within the same framework. It is naturally the case of brokers 
who have labels for all their clients (though some clients may have several bro- 
kers, consequently a given broker might not be able to identify all the trades of 
a given client). In general, this is the case of any financial institutions which 
has access to the historic of all its own trades. 

In our model, we chose, to consider only the case of a single labeled agent 
that is sending market orders at some deterministic time. The case where there 
are several such agents is a straightforward generalization. The trades arrival 
of this labeled agent is represented by a 2d deterministic function 

( in ^''^ 

representing the market orders arrival at the best ask and at the best bid 
{A~). Again, (resp. A~) jumps upward by 1 as soon as the agent is buying 
(resp. selling) (we do not take into account the associated volumes). 

In all the following, we consider that the agent is active only on a 
finite positive time interval, i.e, 

3r>0, Support{dAt) C [0,T[ (36) 



We are ready now to reformulate the previously introduced model (see Sec- 
tion [5T|21) taking into account the labeled agent. 

4.2 The model 

In its most general form the model writes 

Xt = M + ^*dPt + e*dAt, (37) 

where is a 4 x 2 matrix. In the same way as we did for we decompose the 
kernel 8 using 2x2 matrices : 

= ( |/ ) (38) 

• (influence of the labeled trades on A"^) : accounts for the herding of 
the anonymous trades with respect to the labeled trades. 

• (influence of the labeled trades on A^) : accounts for the impact of 
a single labeled trade on the price. A priori there is no reason not to 
consider that the impact of a labeled trade is not the same as the impact 
of a anonymous trade, i.e., we will take 

^ $^ (39) 

Let us point out that the model introduced in Section 12.1.21 is a particular case 
(when At — 0) of this model. In that sense, this model is a more general model. 
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5 Market impact in the model with labeled agents 

5.1 Computation of the market impact profile 

Let us give an analytical expression for the market impact profile. 

Proposition 5 (Market impact profile) The expectation of the intensity 
vector is given by 

£:(At) = (a + *)*(M + e*<iAt), (40) 
where 5* is defined by its Laplace transform 

= $,(!- 

The market impact profile between time and time t of the labeled agent is 
defined as 

MIt^E{Xt) = E{N+ ~N,-). (41) 

It can be written 

MIt^iS~A^)i.A(j)^*{A+ -A-), (42) 
where A^t is defined as 

A?..l a-A^^ + Ai^)_ _ 

Proof of It is very similar to the proof of p^ . Let dZt be the martingale 
defined as 

dZt = dPt - Xtdt. 



Using ([ST]), we get 

At = M + e ★ dAt + $ ★ dPt = A/ + e * d^t + $ * YdZt + $ ★ YXtdt. (44) 
which gives 

{Sl~<^)i.Xt= M + Qi.dAt + ^*dZt, (45) 
which is equivalent to 

At = {SI + *) ★ (M + e ★ dAt) + [SI + $) * $t * dZt. (46) 

Taking the expectation, we get (1^0]) . 

Proof of (gSl) 
Using ((23l) . we get 

£;(A^t) = {SI + D) K {M + e * dAt). (47) 
where -fC is the 2x4 matrix defined by 

K ^{ 'P^ M - $^ ) (48) 



18 



Thus 

EidN+^dN-)=ui6I + D)^K^(^ fil^f^ ) (49) 

where u stands for the vector u = (—1, 1). Let us recall that we chose 9^ — . 
Using Remark 1, all the 2x2 matrices involved in this equation are commuting 
since they are symmetric along both diagonals. Thus one gets 

E{dN+ - dN^r ) = u{51 + D) ★ {SI - $^ + e^) ★ ★ dAt (50) 

Thus the market impact is 

MIt = u{SI + £1) ★ ((51 - + e"^) ★ ★ At (51) 

Using Remark 1 again, we deduce pB]) 



Let us state a trivial corollary of this last proposition that will be of partic- 
ular interest in order to study the permanent impact in the following Section: 



Corollary 1 In the case of an impulsive impact kernel (see Section \2.1.3\} and 
— 0, the market impact profile of a single buying market order sent at time 
t = (i.e., dA^ = 6t and dA^ = 0) is given by 



Mlt^l- f A^udu (52) 



where A^t is defined as 



U.-l . ^ (53) 

(1- A(/)^)(l- A0^)-/A0^^ 

In Fig. |4] are displayed 2 examples of market impact profiles computed 
according to Eq. ()42p for a meta-order consisting in buying a constant amount 
of shares during a period T (i.e. dAt — T^^dt ii t <T and dAt — otherwise). 
The parameters Fig. lUJb) correspond to the ones estimated from empirical data 
as discussed in Section [7] (notably Acj)^ ~ 0.9). One can see that Fig. IDJb) 
reproduces fairly well the empirical shape measured using labeled database (as, 
e.g., in [20l[5]): a concave shape of the profile MIt for times t < T (the so-called 
"square- root" law |13| ) and a convex relaxation towards the permanent impact 
when t > T. This shape is qualitatively explaind in Section [TjH 

For illustration purpose, the parameters of Fig. IH^a) are the same as the ones 
of Fig. m^b) except that they have been tweaked in order to get A0^ < 1/2. 
The fact that the asymptotic value of the market impact is larger is explained 
by point iii) of the next section. 
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0.5 1 1.5 2 0.5 1 1.5 

t/T t/T 



Figure 4: Example of two market impact profiles according to Eq. (|42|) with 
= and dAt — T~^lt<Tdt. The kernel shapes and the parameters have 
been chosen to roughly match the values observed empirically in Section [T] The 
two figures only differ by the values of A0^. (a) Acj}"^ = 0.45 (b) A0q — 0.95. 



5.2 Permanent versus non permanent market impact 

One important consequence of (|53p is that the asymptotic market impact of a 
single labeled market buying order at time i = is of the form (assuming an 
impulsive market impact kernel) : 

M/+^ = (l-ACo). (54) 

Thus controlling how permanent is the market impact essentially amounts in 
controlling how large A^o is where 

Afn = 1 ^ ^ (55) 

(l-A0o^)-/A0^^/(l-A</.;f) 

thus 

M/+00 = ^ ^ (56) 

(l-A0^)-/A</-^/(l-A0j) 

Let us note that, the stability condition of the process (see Proposition[T|) implies 
that |1 - A0^| < 1 and |1 - A^| < 1. 

This last equation allows us to identify three different dynamics that can 
lead to a decrease of the permanent impact. They all correspond to different 
ways of introducing mean reversion into the price : 

i) Mean reversion due to microstructure. This case corresponds to the case 
A(f)Q is very negative, i.e., much larger than H^^'^H. However 

this effect is limited by the fact that we know that we cannot go below 
A(j)Q > — 1 due to the stability condition. 
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ii) Feedback cross influence of the price moves on the trades. This case 
corresponds to the case AipQ is very negative, e.g., 1 1(/>^''^| | large and (^^'* = 
0. This is definitely an effect that is present in real life (see Section ^ 
: as the price goes up, traders are sending more and more selling buying 
orders. 

iii) Auto-correlation of the trades . The previous effect i) is even stronger 
if there is a strong correlation in the signs of the market orders (i.e., 
||A0^|| ~ 1). Let us point out that Fig. 0] shows a clear illustration of 
this point : the left market impact curve has a stronger permanent market 
impact than the one of the right, this is due to the fact that it corresponds 
to a smaller ||A^||. 

As we will see in Section [m all these effects are present in real data. And they 
will all play a part in reducing the asymptotic market impact (see Section [7.41) 

5.3 Estimation from non labeled data. Response function 
versus Market impact function 

As already pointed out, most markets do not provide labeled data. The order 
flows are made of anonymous orders sent by anonymous agents. A priori, in 
that case, the only way of quantifying the "impact" a given market order has 
on the price is to estimate numerically the response function Rt- The response 
function is defined as the variation of the price from time to time t knowing 
there was a (e.g., buying) trade at time 0. Thus it can be written 

Rt = E{N+ - N[- \dT+ = 1), for t > 0, (57) 

and Rt ^ otherwise. It can be written as 

Rt^ [ E{dN+\dT+ = 1)- [ E{dN,-\dT+ = 1), for i > 0, 
Jo Jo 

Notice that within the model ([T]) of Bouchaud et al., this response function 
can be explicitly related to the "bare impact" function G{n) and therefore used 
to estimate its shape [Sj. It is important to understand that this function is 
fundamentally different from the market impact profile of a single market order. 
The market impact isolates all the market orders of a single agent (e.g., a meta- 
order) and quantifies what the impact of these market orders are. In order to 
compute it, one, a priori, needs to identify all the market orders of a particular 
agent. This is not the case of the response function which is "polluted" by the 
impact of all the market orders that are in the same meta-order as the market 
order that is under consideration. 

A very important consequence of our model (as it will be explained in Section 
17. 4p is that our model allows estimation of the market impact profile even if no 
labeled data are available. It will basically consist in first estimating all the 
kernels (see Section [5]) and then using the analytical formula P5|) . 
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Let us point out that one can easily obtain an analytical formula for the 
response function using Proposition |6] of Section 19.21 We prove that if gt is the 
4x4 matrix defined by gt = {gl''}i<i.j<4 with 



gt'dt = E{dPl\dP^ = 1) " <^'^5t - A'dt, 



then 



Using Lemma [1] if we define 

Rt^ [ E {dN+ \dT+ = 1)-A^ du and R^t ^ [ E{dN-\dT+ = 1) - A^du, 
Jo Jo 

and if we define dRf — rfdt, dRf — r^dt, then, 

'l^= {l + D){l + b*){E^)* /A^ 
Using Remark 1, we get 

(I - A0^*)A0^ + (A^/A^)(I - A0'^)A0^* 



(1 - IA(j)T){l _ A0W) - A(/.^A0-P 
In the case of an impulsive impact kernel, it gives 



Rt = /(I - / A^,;du), Vt > 
Jo 



where A^^ is defined by 



(1 - A(/.^*) + (A^/A'^^)(l - A0^)A0^'*/ 



iF* T-1 



(1- A?i^)(l- A0^)-/A0-P 



6 Non parametric estimation of the kernel func- 
tions 

In this section we provide a new method to estimate the shape of the kernels 
involved in the definition of the modelfj. A former non parametric estimation 
method has been introduced in [1]. This method relies on the expression psp 
and mainly consists in extracting the square root of the autocorrelation matrix. 
However, in order to do so univocally, one has to suppose that the process is fully 



^Notice that during the completion of this paper, we have been aware of a work by M. 
Kirchner introducing a non parametric estimation method very similar to the one presented 
in this section. Let us point out that he has performed a comprehensive statistical study of 
the main properties of this estimator |17) . 
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Figure 5: Numerical estimation of the Hawkes kernel matrix (f> by solving the 
Fredholm equation ()90|) for an exponential model. Estimations are represented 
by symbols (•) for the "self" kernels and (o) for the "cross" kernels. Exact 
exponential kernels are represented by the solid lines. The model corresponds 
to (jP^'" = 0^'" = (/)-^'^ = (^■"''^ = 0, (t^'" = 0.04 e-*/^ ^^■'^ = 0.02 e-*/^ 
0^^" = 0.02 e-*/2" and (j)^'" = 0.06 e^^/^^. 

symmetric and in particular that T and N have the same laws. This assumption 
is clearly unrealistic, therefore the method of [Ij is not suited to estimate the 
process defined in Scction[21 For that reason we introduce an alternative method 
that does not require any symmetry hypothesis. This method relies on the 
Proposition [3 of Annex |nil(Eq. 

5t = 0*((5I + 5t), Vt>0. 

where gt is the matrix of conditional expectations defined in Eq. (j85p . This 
above equation is a Fredholm equation of 2nd kind. Since gt can be easily 
estimated from empirical data, the matrix (j) can be obtained as a numerical 
solution of the Fredholm system. We thus implemented a classical Nystrom 
method that amounts to approximate the integrals by a quadrature and solve a 
linear system [25]. In order to illustrate the method, we have reported in Fig. 
[5] the estimated kernels in the case when all functions (pt have an exponential 
shape. The sample used has typically 3 10^ trade events and 1.5 10^ price 
change events. One can see that for such sample length, numerical estimates 
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Figure 6: Estimation squared error as a function of the number of events for 
the model of Fig. [5] in log- log scale. The slope is —1. 



and the real kernels are close enough to determine a good fit of the latter. Let 
us note that we have checked that the method is reliable for various examples 
of kernels like exponential, power-laws or constant over an interval. 

In order to investigate the efficiency of the method, one can evaluate the 
estimation error behavior as a function of the number of events. This error 
can be defined as the supremum of the mean square error associated with each 
kernel: if 0Ar^ stands for the estimated kernel matrix for A^g trading events, then 
one can consider the following square error; 



e^(iV,) = S sup| 

A log-log plot of E'^{Ne) as a function of Nf, is reported in Fig. |6] where E^i^N^) 
has been estimated, for each N^., using 500 trials of the model. The measured 
slope is close to —1 in agreement with the standard behavior of the estimation 
error: 

em ^° 



A comprehensive study of the statistical properties of the method is out of the 
scope of the present paper and will be addressed in a forthcoming work. 



7 Application to real data 

In this section we apply the main theoretical results we obtained in the previous 
sections to real data. We consider intraday data associated with the most liquid 
maturity of EuroStoxx (FSXE) and Euro-Bund (FGBL) future contracts. The 
data we used are trades at best bid/ask provided by QuantHouse Trading So- 
lution Each time series covers a period of 800 trading days going from 2009 



"^http:/ /www. quanthouse.com 
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May to 2012 September. The typical number of trades is around 40.000 per day 
while the number of mid price changes is 20.000 per day. 

7.1 Kernel matrix estimation 
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Figure 7: Numerical estimation of the Hawkes kernels of EuroStoxx in the 
intraday slot [9 a.m., 11 a.m.] 



In this section we apply the non parametric kernel estimation algorithm 
presented in Section |6] to our data. Since intraday statistics are well known 
to be non stationary, estimations are based on a 2 hours liquid period from 9 
a.m. to 11 a.m. (GMT). The results of the kernel estimation of EuroStoxx are 
displayed in FigsH [8] and H 

One first sees that (j)^''^ is small as compared to 0-^^". This appears more 
clearly on the "zoom" presented in Fig. [5]where we have removed the first point 
in order to get a finer scale. The fact that 0-^''' is larger than cj)^'^ confirms the 
well known strong correlation observed in trade signs (see Section l7.3p . The 
mid-price dynamics is well known to be mainly mean-reverting [3]. This is 
confirmed by Fig. |9] where a greater intensity of the kernel (j)'^''^ as compared 
to 0^^" can be observed. Both kernels c/)-^'" and 4>^''^ are found to decrease as a 
power-law: 

with an exponent /3 ~ 1.2 for 0-^^* and /3 ~ 1.1 for cj)^''^. The cut-off c insures 
the finiteness of the norm of (j) and is found to be smaller than 10~^s while 
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Figure 8: Numerical estimation of the Hawkes kernels 0-^^" (a) and (/'^■'^ (b) in 
the intraday slot [9 a.m., 11 a.m.] for EuroStoxx. This is a "zoom" of Fig. [71 
One sees that (f)^''^ slowly decreases and is very large as compared to (f)^^'^ . 

the values of a are typically between 0.05 and 0.1. This power-law behavior 
is illustrated in Fig. [TU] where we have reported in log-log scale the estimates 
and (Z)^'"^ for both EuroStoxx (• ) and Euro-Bund (solid lines). Let us 
notice that power-law behavior of Hawkes kernels have already been observed 
in [1] using a simple 2D Hawkes model of price jumps. This important property 
suggests the existence of some scale invariance properties underlying the order 
book dynamics. Let us point out that Fig. [TUlalso suggests that the kernels are 
the same for EuroStoxx and Euro-Bund. 

As far as the impact is concerned, one can see in Fig. [7] that only (p^'" is 
significant and well modeled by an impulsive kernel (see Section I2.1.3[) . This 
confirms the fact that a buy (resp. sell) market order increases the probability 
of an upward (resp. downward) movement of the mid-price but only within a 
very small time interval after the trade. The feed-back kernel that accounts for 
the influence of a price move on the trading intensity is also found to be well 
localized but only the cross term is non negligible. It seems that an upward 
(resp. downward) move of the mid price triggers an higher trading activity on 
the bid side (resp. on the ask side). 

7.2 Introducing a model with intraday seasonalities 

In order to check the stationarity of the kernels, wc performed various estima- 
tions on different intraday slices of 2 hours. The results for (f)^''' and t/)^'"^ are 
reported in Fig. [TIT a) and Fig. [TlTb) in the case of EuroStoxx. One can see 
that the shape of the kernels does not depend on the intraday market activity 
and are remarkably stable throughout the day. An estimation of the norms of 
all kernels allows one to estimate the rate /j, through (fT6|) or (fTT]) . In Fig. [TTl c). 
we see that this parameter follows the classical U-shaped intraday curve. It thus 
appears that, in the model, the exogenous intensity of trades fully accounts for 
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Figure 9: Numerical estimation of the Hawkes kernels (/j^^" (a) and (/i^''^ (b) in 
the intraday slot [9 a.m., 11 a.m.] for EuroStoxx. This is a "zoom" of Fig. [71 
One sees that (f)^''^ is larger than 0^'". 



the intraday modulation of market activity. 

It is therefore natural to consider the following model with seasonal varia- 
tions that generalizes, in a particularly simple way, the definition ([7]): 



\t = Mt + ^i^dPt; 



(58) 



with 



( ^* \ 








(59) 



V / 



where is a U-Shaped 1-day periodic function. Let us point out that this is 
particularly elegant and simple way of accounting for an a priori pretty complex 
phenomenon. 



7.3 Trades correlations, Price correlations and Efficiency 

Few years ago, Bouchaud et al. [8 and Lillo and Farmer [TS] independently 
brought evidence that the market order flow is a long memory process. By 
studying the empirical correlation function (in trading time) of the signs of 
trades they have shown, for different markets and assets that: 

C{n) - n'^ (60) 

where n is the lag in number of trades (i.e., using trading time). This scaling 
law remains valid over 2 or 3 decades and the exponent 7 is in the interval 
[0.4,0.7]. The aim of impact price models as described in the introductory 
section (Eq. ^) was to solve this " long- memory" puzzle. Indeed, since trades 
impact prices, if trades are long-range correlated, in order to maintain efficiency. 
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ln(t) ln(t) 

Figure 10: Scaling of Hawkes kernels cf^'^ and (f)^'"^. The kernels are repre- 
sented in double logarithmic representation for Eurostoxx (•) and Euro-Bund 
(solid lines) estimates in the intraday slot [9 a.m., 11 a.m.]. (a) (f)^''^ (b) 0-^'°. 
These plots suggest that the kernels have the same power-law behavior for both 
EuroStoxx and Euro-Bund. 

the price have to respond to market order through a long-memory kernel. This is 
precisely the meaning of the kernel G{j) in ([T]). Bouchaud et al. [8^ have shown 
that, provided the specific shape of G[j) is adjusted as respect to the behavior 
((60|) . trade correlations impact on prices can be canceled (see also [71 for an 
interpretation of this price model in terms of prediction error or "surprise" ) . 

In this section, we explain how this issue can be addressed within our model. 
It is important to point out that, as we will see, the apparent long-range cor- 
relation of the trade signs and the price efficiency will be a consequence of 3 
empirical findings (cf Section 17. 1|) : 

• A ( is power-law 

• is close to 1, i.e., it almost saturates the stability condition A(/)J < 1 

• A'cj)^ < 0, i.e., the price at high frequency is mainly mean-reverting. 

About the long-range memory of the trade sign process The exact 
expression of the autocorrelations of the increments of Ut = — T^^ and 
Xt — — as a function of the lag r can be hardly deduced from (|3ip 
and ([M]) . In order to discuss the shape of theses correlation functions one can 
however use qualitative arguments based on classical Tauberian theorems [12] . 
In what follows the argument z of Laplace transforms is assumed to be real and 
positive (z > 0). 

Let us first remark that, within the range of parameters observed in em- 
pirical data (and notably the relationship A-^ ~ 2A^), one can show that the 
terms involving A0^ both at the numerator and at the denominator in ([34]) are 
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Figure 11: Numerical estimation of the Hawkes kernel matrix (j) for various 2 
hours slices during the day for EuroStoxx. In (a) and (b) 7 estimated kernels 
arc represented in doubly logarithmic scale corresponding to the 7 slots: [8 a.m., 
10 a.m.], [9 a.m.. 11 a.m.], [11 a.m., 1 p.m], [12 a.m., 2 p.m.], [1 p.m., 3 p.m], [2 
p.m., 4 p.m] and [3 p.m., 5 p.m.]. In (a) (p"^'"^ and in (b) cj)^'". One sees that the 
kernel shapes are remarkably stable power-laws across intraday time periods. 
In (c) we reported the estimation of the constant rate as a function of the 
intraday time. One recognizes the classical U-shaped behavior. 



subdominant. In this case, this equation, reads (we neglect the g^^^ factor and 
drop the (h) upper-script everywhere): 



C 



2A^ 



(1-A</'D 



(61) 



Let us note that, according to this expression, the stability condition < 1 
implies that Cq < oo, i.e., the correlation function of the trades is integrable 
and that, strictly speaking, there is no long-range memory in supply and de- 
mand. Let us show however that, under the conditions we observe empirically, 
Cj can reproduce, on a pretty large (though finite) range of r, a slow-decay with 
an exponent smaller than 1, leading various numerical estimations to conclude 
long-range memory. 
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As observed in previous section, ^cfp" is close to a power-law > 0): 

A<?!)f = a(c + t)-'^ (62) 

were (3 — 1 + v is the scaling exponent (empirically we found v ~ 0.2), c is a 
small scale cut-off (empirically c < lO^^s), and a is a factor such that the norm 
A^o" < 1- On has obviously: 

(/3-l)A^ 

If one computes the Laplace transform (with z > 0) of expression (15^ . it is easy 
to show that, in the limit of small z (in practice that means z < c^^) 

A0j^A^(l-r(l-^)(cz)'') 

Thus, according to (|6ip . one has 

Cj ^ , (63) 

(i^^<j,i) + ^(gni^v){czY 

and consequently ~ C — C z^ and therefore, thanks to Tauberian Theorems 
(limit of small z corresponds to limit of large time) : 

However, if we suppose that not only z is small but that A(/)j^ is close enough 
to 1 such that 

1- A^ < A0;^r(l-i/)(cz)'' (64) 

then becomes 
or equivalently 

^ T^'^-i (65) 
Let us point out that the inequality (|64l) holds, as long as: 

z>c-^{ V^c-HO-^ (66) 

using the estimates v ~ 0.2 and A0j^ ~ 0.9. Since z < , this means that 
it can take 5 decades to see the "short range" nature of the trade correlation 
function. Considering that c ~ 10"^s, for an inter event mean time of Is, the 
scaling can extend over 3 decades. Since 1 — 2z/ ~ 0.6, this exponent is 
precisely in the range of values reported empirically in [5] and [12] . 
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About price efficiency From an empirical point of view, in agreement with 
market efficiency hypothesis, it is well known that price variations are almost 
uncorrelated after few seconds. Let us show, that, under the conditions observed 
in empirical data, decreases very fast around t — Q. We provide the same 
kind of pedestrian arguments as in previous discussion: we neglect the influence 
of A(/)^ and suppose that we are in the "impulsive" case of the impact kernel, 
i.e., = I5f Let us remark that, since A-^/A^ is bounded (empirically 

\T I p^N ^ if / is small enough (empirically / < 10~^), the same arguments 
invoked for shows that in the intermediate range: 

one has A^|A^^p < A^|l - A^^p. It results that (|3T]) reduces, in this range, 
to: 

2A^ 



C. 



N 



2 



(1-A0f) 

In order to study the behavior of this function, we use again a power-law ex- 
pression for A(f>^ : 

A(f>^ ^a'{c + t)-^' (67) 

where empirically a' < (and thus A0^ < 0) and f3' = 1 + with i/' ~ 0.1. It 
follows that 

Cf ~ C 

in the range 



J(l-z.')|A0^ 

Since A0^ < 0, |A0^| < 1, T{l-iy') ~ 1, ~ 10, we have 1 « (^^^^l-^g^) 

and the previous inequality always holds. This shows that the price increments 
correlation functions decreases very fastly around zero without any "fine tuning" 
of the kernel shapes. 
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Numerical illustrations All these considerations are illustrated in Fig. [121 
where we have simulated a Hawkes process with the parameters close to the 
ones we found empirically. Let us notice that, in order to mimic the experiments 
performed in [8] and |19j , the correlation functions are computed in trading time, 
i.e., a discrete time which is incremented by 1 at each jump of Ut- We checked 
that setting a lag n in trading time roughly amounts to consider a lag r in 
physical time such that: t — In that respect, the shapes of the correlation 
functions in trading time and in physical time are very close to each other, up 
to a scaling factor. In Fig. [T2\a,) are displayed C^/Cq (•) and /Cq (sohd 
line) as functions of the lag n (in trading time). On clearly sees that is 
slowly decaying while almost vanishes after few lags. In Fig. [T2\h) is 
represented in log-log. As expected, it behaves as power-law with the exponent 
2v —\ (—0.6 in the example we choose) represented by the solid line. 
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Figure 12: Empirical correlation functions of the increments Ut and Xt in trad- 
ing time. The process is a numerical simulation of a Hawkes process where the 
kernels have been chosen to fit the empirical observations found in 17.11 The 
correlation functions where estimated on a single realization containing around 
40.000 market orders, (a) C^/C^ (•) and /C^ (solid line) as functions of 
the lag n (expressed in trading time), (b) in double logarithmic reprensenta- 
tion (•). The solid line represents the power-law fit with expression (j65p where 
2v-l = -0.6. 



7.4 Market impact profile estimation from anonymous data 

As discussed in Section [Sj once one has estimated all the model parameters, 
it is possible to compute the shape of the market impact of some particular 
(meta-) order (Proposition [5] with — 0). If one uses the parameters reported 
previously, one gets a shape of the market impact profile associated with the 
meta-order dAt = T~'^lt<Tdt similar to the one displayed in Fig. HJ^b). One can 
use similar qualitative arguments than in previous section to explain the shape 
observed in Fig. Hljb). Indeed, according to (|^ and using the same notations, 
parameter values (notably the fact that is close to 1) and approximations as 
discussed previously, one can show that in a wide intermediate range of laplace 
parameters z, we have: 

MI, ~ z^A,. (68) 

1 ^ 1 — zT 

If dAt = ItKxdt, then A, = tz'^ — ^^"^ therefore 

MI, - z"'^ if z < T"^ 
MI, - z"-^ if z > T-\ 

It follows that the market impact profile behavior reads: 



MIt - f-" if t:^T 
MIt - t^^" if i < T. 
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that corresponds to the behavior observed in Fig. Sfb). Notice that a strict 
"square-root" would correspond to = 1/2 while it seems we rather observe 
~ 0.2 empirically. 

In general, the determination of the market impact profile is a hard task 
that requires to possess agent labeled (e.g. broker) data. The model presented 
in this paper allows one to recover a market profile using anonymous order flow 
data as explained in Section [5.31 

8 Conclusion and prospects 

From our knowledge, the model we developed in this paper is the first model 
that accounts for both stylized facts of market prices microstructure (including 
random time arrival of price moves, discrete price grid, high frequency mean 
reversion, correlation functions behavior at various time scales) and the stylized 
facts of market impact (mainly the concave/relaxation characteristic shape of 
the market impact of a meta-order) . Analytical closed formula can be obtained 
for most of these stylized facts. Not only it allows us to reveal (through the 
estimations of the different kernels) the dynamics involved between trade arrivals 
and price moves, but it also allows us to estimate the entire market impact profile 
from anonymous market data. 

As far as trade and price dynamics are concerned, we have provided evidence 
of a power-law behavior of the kernels (f^ and (p^ and that the model is close 
to instability (i.e. A(/)(^ is smaller but close to 1). This suggests the existence of 
some self-similarity properties in the order-book dynamics and sharply contrasts 
with the usual exponential kernels used in former parametric Hawkes modeling 
in Finance. The cross-kernels associated with impact of trade on prices (0^) 
and feed-back {(f)^) are well localized in time (i.e. of "impulsive nature"). Thus, 
upward (resp. downward) price moves are mainly impacted by trades on the ask 
(resp. bid) side. In turn, positive (resp. negative) mid-price variations imply 
an increase of the trading intensity on the bid (resp. ask) side. 

Besides these important points, qualitative arguments showed that, provided 
A(/)j^ ~ 1 and < 0, the "long memory puzzle" of the order flow raised by 

Bouchaud et al. [8] can be addressed without any fine tuning of the model 
parameters: trades naturally appear as long-range correlated over a wide range 
of lags while price variations are almost uncorrelated. Moreover, the same kind 
of arguments can explain the concave (" square- root" law) / relaxation typical 
market impact shape and an almost vanishing permanent impact. 

Let us first point out that the model, as is, can be used as a "stochastic 
price replayer" using as an input the true market order arrivals in place of the 
stochastic process Tt . This allows one to "replay" the price of a given historical 
period and, for instance, using it as an input price for any algorithm designed 
to estimate or manage a risk. 

In this paper, we have presented the most basic form of the model. It can be 
seen as a building block for more elaborate models depending on what it is meant 
to be used for. There are many ways for extensions. Let us just go through 



33 



some of them we have aheady developed or we are still working on. For instance, 
it would be important to have a model which not only takes into account the 
arrival times of the market orders but their volumes too. This is a pretty easy 
extension since it can be done within the framework of marked Hawkes processes 
for which straightforward extensions of all computations presented in this paper 
can be obtained. In the simplest form, one could use i.i.d. volumes Vt for the 
market orders : at any time t a market order arrives {dT^ = 1 or dT^T = 1) a 
volume Vt is chosen randomly (using a given law). In its simplest form the new 
model consists in replacing the projection of ([7]) on the last two components by 

^^^i.dNt + 'P'i^fMdTt, 

where / is a function that describes how the volume impacts the price. It 
basically corresponds to what is generally referred into the literature by the 
"instantaneous impact function" . 

In order to go further into the understanding of the underlying dynamics of 
the order-book, a very natural extension of the model, consists in using more 
dimensions in order to take into account limit/cancel orders. Thus, for instance, 

one way would be to introduce a new point process Lt — ^ ^+ ^ where Lf 

(resp. L^) is incremented by 1 whenever a limit order arrives at the best ask 
(resp. best bid) or a cancel order arrives at best bid (resp. best ask). One 
would then need to introduce the different kernels that account for the influence 
of L on r and N and the kernels that account for the influence of T and N 
on L itself. The estimation procedure of the kernels can follow the exact same 
procedure described in Section |6l Along the same line, i.e., by adding new 
dimensions to the 4d-Hawkes model presented in this paper, one could quantify 
the impact of a given exogenous news on the market order flow or directly on 
the price by estimating the corresponding kernel. Or, alternatively, one could 
consider a multi-agent models (e.g., adding 2 dimensions for each agent) and 
model/estimate the influence of a given agent on another one or on all the 
anonymous agents (as does in (l37p V These extended framework would open 
the door to precise estimations and obvious interpretations in order to get better 
insights into to full order book dynamics. 

9 Annexes 

9.1 Stability condition : proof of Proposition [T] 

In this section, we give the proof to the Proposition [T] For the process to be 
stable, we need the eigenvalues of the matrix $o to have a modulus smaller 
than 1 (cf hypothesis (H)). 

Lemma 2 (Eigenvalues of $0) 

If X is an eigenvalue of ^0, then it satisfies 



(a — x){b — x) — c =0 



(69) 
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or 

{a+ - x){b+ - x) - c+ ^ 0, (70) 

where a* = 0^''' ± 0^'^, 6* — 0^'* ± ^^''^ and = (f/'^''' ± 4>q''^){4'1)^ i ?^o''^)- 
fiei us point out that, using Notations\^ a" = A(j)Q , b~ = A(j)Q and c~ = 

Proof 

From p2)) we get 

Since all the matrices are bi-symmetric, they all commute and thus 

det($o - 2^1) = det (($^ - - x\) - $^$^) = (72) 

Moreover, they all diagonalize in the same basis and their eigenvalues are the 
sum and the difference between the self term and the cross term. Thus the 
eigein values of - a;I)($^ - xl) - $^$o satisfy either (jMl) or (fTO]) . 



We now have to study when the modulus of the roots of these equations are 
smaller than 1. 

Lemma 3 (Condition for the roots to have a modulus smaller than 1) 

The roots of the equation 

x? — x(a + 6) + a6 — c = 
have a modulus smaller than 1 iff 
(i) either 

c< -(a -6)^/4 and ab - c < I (73) 

(a) or 

c>-{a-bf/A and |a + 6| < min(l + a6 - c, 2) (74) 

Proof 

The discriminant of this second-order equation is (a — 6)^ + 4c, thus there are 
two cases 

(i) c < — (a — fo)^/4. In this case the roots are complex and conjugated one of 
the other. Thus their modulus is smaller than 1 iff their product is smaller 
than 1, i.e., ab — c < 1 which gives the result for (ii) 
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(ii) c > —(a — hY /A. In this case the roots are real. The condition for them 
to be smaUer than 1 is 

1 - (a + 6) + a& - c > and ^^-^-^ < 1 

and the condition for them to be greater than -1 is 

1 + (a + 5) + a6 - c > and " ^ ^ > -1 
This synthesizes into case ii of Lemma 

Using these two lemmas, let us study the stability condition corresponding to 
and ([TO]) . Before starting, let us note that, since all the (j) functions are 
positive functions (thus all the (t>Q are real positive), it is clear that 

|a"|<a+, \h^\<h+, \cr\<c+. (75) 

Proving (|13p is a necessary condition for (H) to hold. 

For this purpose, we just need to look at the condition for the roots of ([70]) to 
have a modulus smaller than 1. Indeed, ([70)) falls in the case of (ii) of Lemma[3l 
since c = c"^ > 0. Moreover ([7^ is equivalent to |a++6+| < min(l+a+5+— c+, 2) 
which is equivalent to 

c+ < (1 - a+)(l - 6+) and a+, h+ < 1 (76) 

which proves (fT^]) 

Proving (fT5|) is a sufficient condition for (H) to fiold. (and proving 
d) 

For that purpose, we suppose that (jTS]) holds. We want to prove that the roots 
of both and ([70]) have modulus strictly smaller than 1. We have just seen 
this is the case for the roots of ([70]) . we just need to check that it is also the 
case for the roots of ([55)1 . For ([70)1 the case (i) reads 

a-b- - 1< < -{a- - b^f/A 

and the case (ii) (since + < a+ + 5^ < 2 according to ([76]) ') 

-(a" - 6^)^/4 < < 1 + a"6" - + b~\ 

Thus, merging these last two inequations, we get the following condition for the 
roots of to have a modulus strictly smaller than 1 

a"6" - 1 < < 1 + a^fe" - + (77) 

which is nothing but (fT4| . Thus, in order to complete the proof of Proposition 
m we just need to prove that this last inequation holds (i.e., (ITil) holds). 
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Let us first notice that, since \a | < a"*" < 1 and \b | < &^ < 1, one has 
2a~b~ < \a~ + b~\, and consequently 

1 + a"&" - + &~| < 1 - a~b~. 

Since 1 — a~b~ > the following inequation 

|c"| < 1 + a^b^ - + 

is a sufRcient condition for ([77l) to hold. Moreover since |c^| < c^, using ([T3|) . 
a sufRcient condition for ([77l) to hold is 

(1 - a+)(l - 6+) < 1 + a-b- - \a- + b-\. 

which is equivalent to 

a+ + b+ -a+b+ >\a- +b-\-a-b~. (78) 

Since, one has 

a+ + b+-a+b+ = a+{l-b+) + b+ >\a-\{l-b+)+b+ (79) 

> 5+(l-|a-|) + |a-|>|6"|(l-K|) + |a-| (80) 

> |a-| + |6-|-|a-||6-|, (81) 

inequation (I78p is implied by 

\a-\ + |5"| - |a"||6"| > |a" + 6"| - fl-fo". (82) 

Thus, in order to complete the proof of Proposition [l] we just need to prove 
this last inequation. 

In the case a~ and b^ have the same sign, this inequation is a actually a strict 
equality, so it obviously holds. Let us suppose that a" and 5^ do not have the 
same sign. Without loss of generality, we can suppose that a~ < < 6~. We 
distinguish two cases : 

• either < —5^ < 0, in which case 

\a-+b^\-a^b- = -6"(l+a")-a" < 6"(l+a")-a" = |a"| + |5"|-|a"||5"|. 

• or —b~ < a~ < 0, in which case 

\a-+b^\-a-b- = a"(l-6")+6" < -a-(l-&-)+&- = |a-| + |6-|-|a"||6"|. 

9.2 Conditional expectation of a Hawkes process 

In this Section, we establish general results on A^-dimensional Hawkes process 
P = {P^}i<i<N and more particularly on the expectations of dP^ at time t 
conditioned by the fact that the P^ jumped at time 0. We mainly establish 
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two results. The fist one (Prop. [5]) links these conditional expectations with 
the auto-covariance function of dPt and, using previous results [HIl], allows us 
to derive an analytical formula as a function of the kernel. These results are 
used in the Section [57] for characterizing the response function. The second one 
(Prop. [7]) proves that these expectations satisfy a Fredholm system that will be 
used in Section [B] for elaborating a general procedure for the kernel estimation. 

The Hawkes process is defined by its kernel $ — {0*}i<ij<A' and the exoge- 
nous intensity fi = {fJ.^}i<i<N through the equation 

Xt^ fi + (j)*dPt, (83) 

where At is the conditional intensity of P at time t. We consider that the process 
is stable, i.e., the eigenvalues of the matrix $o have modulus smaller than 1. 
We set 

A = E{Xt) = (l-$o) V- (84) 

Finally for all t and V i,j such that 1 < i,j < N, we define gt = {gt'}i<i.j<N 
with 

gl^dt = E{dPl\dP^ = 1) - e'^St - K'dt (85) 

where E{dPl\dPQ = 1) is the conditional expectation of dNl knowing that Nl 
jumps at i = 0, 5i is the dirac distribution and e*-' is always except for i = j for 
which it is equal to 1. Since E{dNl\dN^ = 1) is singular at t = we substracted 
this singular component. In the following gt will refer to the matrix whose 
elements are the gW i.e., 

9t^ {g't}l<^,j<N■ (86) 
We are ready to state our first result. 

Proposition 6 Let Ct.t+r be the infinitesimal covariance matrix (without the 
singular part) as defined by 

Ct,t + T = {Ctft+T}l<4J<JV, 

with 

Cl^t^^dtdT = Cov{dPl+^,dPl) ~ e'^K'Srdt. 
Then gt and Ct^t+r «re linked through the relation 

gr = Ct,t+T^-\ (87) 

Using the analytical formula proved in f^l^ for C^^ ,one gets 

5^ = ^r + S^S"^ + **Svl'J:S"\ (88) 

where = ^'-r and is defined such that (I - $2)"^ = (I + *^). In the 
Fourier domain, this last equation corresponds to 
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Proof of the Proposition. 

This is a pretty straightforward computation : 

C'/t^^dtdT = E{dPi^^dPl) - K'K^dtdT - e'^A'Srdt. 

which gives 

Ci\+^dtdT = E{dPl+^\dPi = l)Prob{dPi 1) - K'K^dtdr ~ e'^A'drdt. 
Using the stationnarity and dividing by dt, one gets 

Cl^t+rdr = E{dP^\dPi = 1)A^ - K'KHt - e'^K'Sr. 
Then, using the definition ([55]) of g"^^ 

Cl^,^,dT = gl^A^dT 
which gives (|57| . From [2 [5] , we know that 

Ct.t+r = + s*^ + * ★ j:¥^, 

This last equation along with ([57]) leads to and then to 
The next result corresponds to the following proposition : 

Proposition 7 Using the notations above, the density gt satisfies the following 
fredholm system of integral equations for positive t 

gt = ^^{5l + gt), Vt>0. (90) 

Moreover, for t < we have 

9t^ - ^glf (91) 

Proof of the Proposition. 

• Proof of (tnOl). We consider t > 

gl' = EidP^dP^, =1)-A = Ei^MP^ = 1) - A' 

Using (1H31), 

N 

9i = /^^ + E '^^^ * E{dPt\dP^ = 1) - A\ 
fc=i 

Using ([SS]) . we get 

AT 

= + ^ 0''= * (5^^" + e^^-^t + A'=) - A\ 
fe=i 
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And consequently 

N N 
fc=l k=l 

However, the vector formulation of + ^k=i 11'^''^ 1 1^*^ ~ nothing but 
A* + (^0 — I) A which is according to (|84| . thus 

N 
k=l 

which gives (|90p . 

Let us note that we could have derived directly the system ((90)) from ([89]) . 
Indeed, ([89]) gives 

(I - = E(I - - I + 

In the time domain, <i>| is supported by M," , thus going back to the time 
domain and restricting to t > directly leads to (|90|) . 



• Proof of ([9T|). Let i < 

E{dPl\dP^ = 1) = Proh{dPl = l|dPj - 1) 



Prob{dP^ = 1) 



Since P is stable, dP is stationary, Prob[dP^ = 1) = Prob{dPl = 1) = A^ , 
thus 

K'E{dPi\dP^ = 1) = Prob{dPl = l,dPi = 1) = E{dP^t\dP^ = 1)A' 
Consequently : 
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